SLEAS: Supervised Learning using Entropy as Attribute Selection Measure
نویسنده
چکیده
There is embryonic importance in scaling up the broadly used decision tree learning algorithms to huge datasets. Even though abundant diverse methodologies have been proposed, a fast tree growing algorithm without substantial decrease in accuracy and substantial increase in space complexity is essential to a greater extent. This paper aims at improving the performance of the SLIQ (Supervised Learning in Quest) decision tree algorithm for classification in data mining. In the present research, we adopted entropy as attribute selection measure, which overcomes the problems facing with Gini Index. Classification accuracy of the proposed supervised learning using entropy as attribute selection measure (SLEAS) algorithm is compared with the existing SLIQ algorithm using twelve datasets taken from UCI Machine Learning Repository, and the results yields that the SLEAS outperforms when compared with SLIQ decision tree. Further, error rate is also computed and the results clearly show that the SLEAS algorithm is giving less error rate when compared with SLIQ decision tree. Keyword-Classification, Data Mining, Decision Tree, Entropy, Gini Index, SLIQ, SLEAS.
منابع مشابه
Spatial Entropy Based Mutual Information in Hyperspectral Band Selection for Supervised Classification
Hyperspectral band image selection is a fundamental problem for hyperspectral remote sensing data processing. Accepting its importance, several information-based band selection methods have been proposed, which apply Shannon entropy to measure image information. However, the Shannon entropy is not accurate in measuring image information since it neglects the spatial distribution of pixels and i...
متن کاملExtended MULTIMOORA method based on Shannon entropy weight for materials selection
Selection of appropriate material is a crucial step in engineering design and manufacturing process. Without a systematic technique, many useful engineering materials may be ignored for selection. The category of multiple attribute decision-making (MADM) methods is an effective set of structured techniques. Having uncomplicated assumptions and mathematics, the MULTIMOORA method as an MADM appro...
متن کاملA Framework for Optimal Attribute Evaluation and Selection in Hesitant Fuzzy Environment Based on Enhanced Ordered Weighted Entropy Approach for Medical Dataset
Background: In this paper, a generic hesitant fuzzy set (HFS) model for clustering various ECG beats according to weights of attributes is proposed. A comprehensive review of the electrocardiogram signal classification and segmentation methodologies indicates that algorithms which are able to effectively handle the nonstationary and uncertainty of the signals should be used for ECG analysis. Ex...
متن کاملQuery Selection via Weighted Entropy in Graph-Based Semi-supervised Classification
There has recently been a large effort in using unlabeled data in conjunction with labeled data in machine learning. Semi-supervised learning and active learning are two well-known techniques that exploit the unlabeled data in the learning process. In this work, the active learning is used to query a label for an unlabeled data on top of a semisupervised classifier. This work focuses on the que...
متن کاملNew Entropy Based Distance for Training Set Selection in Debt Portfolio Valuation
Choosing a proper training set for machine learning tasks is of great importance in complex domain problems. In the paper a new distance measure for training set selection is presented and thoroughly discussed. The distance between two datasets is computed using variance of entropy in groups obtained after clustering. The approach is validated using real domain datasets from debt portfolio valu...
متن کامل